ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth Estimation
نویسندگان
چکیده
Depth estimation is a crucial step for 3D reconstruction with panorama images in recent years. Panorama maintain the complete spatial information but introduce distortion equirectangular projection. In this paper, we propose an ACDNet based on adaptively combined dilated convolution to predict dense depth map monocular panoramic image. Specifically, combine kernels different dilations extend receptive field Meanwhile, adaptive channel-wise fusion module summarize feature maps and get diverse attention areas along channels. Due utilization of constructing module, network can capture leverage cross-channel contextual efficiently. Finally, conduct experiments three datasets (both virtual real-world) experimental results demonstrate that our proposed substantially outperforms current state-of-the-art (SOTA) methods. Our codes model parameters are accessed https://github.com/zcq15/ACDNet.
منابع مشابه
Aperture Supervision for Monocular Depth Estimation
We present a novel method to train machine learning algorithms to estimate scene depths from a single image, by using the information provided by a camera’s aperture as supervision. Prior works use a depth sensor’s outputs or images of the same scene from alternate viewpoints as supervision, while our method instead uses images from the same viewpoint taken with a varying camera aperture. To en...
متن کاملMonocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference
Monocular depth estimation is a challenging task in complex compositions depicting multiple objects of diverse scales. Albeit the recent great progress thanks to the deep convolutional neural networks (CNNs), the state-of-the-art monocular depth estimation methods still fall short to handle such real-world challenging scenarios. In this paper, we propose a deep end-to-end learning framework to ...
متن کاملDepth Estimation Using Monocular and Stereo Cues
Depth estimation in computer vision and robotics is most commonly done via stereo vision (stereopsis), in which images from two cameras are used to triangulate and estimate distances. However, there are also numerous monocular visual cues— such as texture variations and gradients, defocus, color/haze, etc.—that have heretofore been little exploited in such systems. Some of these cues apply even...
متن کاملBayesian depth estimation from monocular natural images.
Estimating an accurate and naturalistic dense depth map from a single monocular photographic image is a difficult problem. Nevertheless, human observers have little difficulty understanding the depth structure implied by photographs. Two-dimensional (2D) images of the real-world environment contain significant statistical information regarding the three-dimensional (3D) structure of the world t...
متن کاملQualitative Estimation of Depth in Monocular Vision
In this paper we propose two techniques to qualitatively estimate distance in monocular vision. Two kinds of approaches are described, the former based on texture analysis and the latter on histogram inspection. Although both the methods allow only to determine whether a point within an image is nearer or farther than another with respect to the observer, they can be usefully exploited in all t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i3.20278